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Sambahsa 


Sambahsa (IPA: [sam ‘ba:sa]) or Sambahsa-Mundialect is an international auxiliary language (IAL) 
and worldlang devised by French linguist Olivier Simon.'7! It is based on the Proto-Indo-European 
language (PIE) and has a relatively complex grammar.*!"*! The language was first released on the 
Internet in July 2007; prior to that, the creator claims to have worked on it for eight years. 
According to a study addressing recent auxiliary languages, "Sambahsa has an extensive 


vocabulary and a large amount of learning and reference material".!°! 


The first part of the name of the language, 
Sambahsa, is composed of two words from the 
language itself, sam and bahsa, which mean 
‘same’ and ‘language’, respectively. Mundialect, 
on the other hand, is a fusion of mundial 


‘worldwide’ and dialect ‘dialect’. 


Sambahsa tries to preserve the original 


spellings of words as much as possible and 


this makes its orthography complex, though 


Created b Olivier Simon 
still kept regular.!°! There are four grammatical 
cases: nominative, accusative, dative and Date 2007 
enitive.!/] 
: Purpose Constructed 
language 
Though based on PIE, Sambahsa borrows a e Sambahsa 
good proportion of its vocabulary from other 
language families, such as Arabic, Chinese, 
Indonesian, Swahili and Turkish.@! ISO 639-3 ( sph code proposal 
was rejected in 
20181") 
Phonology 
Glottolog None 
Sambahsa's phonology has little to do with IETF art-x-sambahsa 
Proto-Indo-European phonology, though the (https: //www.krea 
majority of its vocabulary comes from PIE.!®! —e—r 
Fr 


The changes from PIE are not regular, since the 

creator of Sambahsa has tried to avoid 

homophones, which would have become common after the elimination of some PIE sounds like 
laryngeals or some aspirated consonants. Unlike some auxlangs like Esperanto, the orthography 
of Sambahsa is complex yet regular and consists only of the 26 letters of the basic Latin 
alphabet.'?! This system was chosen to preserve the recognizability of words taken from West- 
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European languages, where orthography plays a key role. For example, according to the rules of 
Sambahsa, bureau is pronounced as in French, and point as in English. 


Sambahsa's phonemic inventory has 22 consonants and nine vowels (excluding the lengthened 
forms of these vowels).!'°! To help language learners, and because IPA symbols cannot be written 
with all keyboards, a special simpler system has been developed, called Sambahsa Phonetic 
Transcription, or SPT. 


Compared to other conlangs, Sambahsa words are short, often as short as English words, and 


highly consonantic.!""! This latter point is in accordance with the PIE background of Sambahsa, 


where roots have often a consonant-vocal-consonant structure.!'2! 


Consonants 


Consonant phonemes of Sambahsa 


Post- 
Bilabial | Dental | Alveolar Palatal | Velar| Uvular | Glottal 


alveolar 
Nasal m n 
voiceless p t k 
Stop 
voiced b d g 
voiceless ts tf 
Affricate = = 
voiced dz dz 
voiceless f 8 s J ¢ x h 
Fricative 
voiced V Z 3 B 
Approximant Ww | j (w) 
Trill r 
Vowels 
Vowel phonemes of 
Sambahsa 
Front |Central| Back 
Close] i(:) | y(:) u(:) 
Mid |e(:)|a(:) =) o(:) 
Open | ¢(:) a(:) 
Stress 


Sambahsa's stress rules are complex but regular, and tend to follow what is often found in 
German or Italian. This predictability implies that all words with the same orthography are 
pronounced and stressed the same way as each other. Thus, for example, while German Prdsident 
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and Italian presidente are stressed on the "ent" syllable, Sambahsa president is stressed on the "i" 

since president can also mean "they preside", and a final "ent" never bears the stress. This 

regularity of accentuation can be compared with English president and to preside, two words that 


bear the stress on different syllables, though they share the same origin. 


Grammar 


Declensions 


In Sambahsa, declensions are only compulsory for pronouns. The declensions of these pronouns 
(demonstrative/interrogative and relative/personal) are mostly parallel, and often show 
similarities with their Proto-Indo-European ancestors. Thus, in all Sambahsa declensions, the 
neuter nominative and accusative are identical, as it was the case in PIE.''*! There are identical 
forms for the relative and interrogative pronouns,''“! as well as for the third person pronoun and 
the definite article (the in English).!"*! 


Sambahsa has two numbers (singular and plural; the dual number of PIE has not been preserved) 
and four grammatical genders: masculine, feminine, neuter, and undetermined. This last gender, 
which is an innovation from PIE, is used when a noun of uncertain or unknown gender is referred 
to, and, in the plural, for groups containing elements of different genders. The creator of 
Sambahsa introduced this non-PIE element to avoid the "gender" dispute found in Esperanto. 


Gender is attributed in Sambahsa according to the "true nature" of the noun referred to, as 


English speakers do with he, she and it. 


Sambahsa has four grammatical cases: nominative, accusative, dative and genitive; however, their 
attribution tries to be as logical as possible, and not arbitrary as in many modern Indo-European 
languages. The nominative is the case of the subject, and the form under which words are given 
in dictionaries. Except for verbs describing a movement or a position (where the appropriate 
prepositions ought to be used), all transitive verbs must introduce the accusative case in the first 
place, before an eventual dative case. However, the dependent clause of indirect speech is 
considered as a direct object, leading to verbs introducing an indirect object, even if there is no 
visible direct object. 


Compare: 
e /s mi antwehrdt od ts ne gwehmsiet cras, 'He answers (to) me that he won't come tomorrow' 
e /s ne mi hat antwohrden, 'He hasn't answered (to) me' 


In Sambahsa, all prepositions trigger the accusative.!'®! 
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The genitive indicates possession, and is used after adjectives that can introduce a dependent 


clause. 


Compare: 


e Som yakin od is ghehdsiet kwehre to, 'I'm sure that he'll be able to do that’ 


e Som yakin eysen (genitive plural) imkans, ‘I'm sure of his abilities’ 


For substantives and adjectives, there are declined "free endings" (i.e. non-compulsory) used 
most often in literary context for euphonics or poetry. This system is inspired by the euphonic 


endings ( (rab) found in Modern Standard Arabic. 


Conjugation 


In Sambahsa, all verbs are regular, except ses (‘to be’), habe (‘to have’), and woide (‘to know’, in 
the meaning of French savoir or German wissen). Sambahsa verbs are indicated in dictionaries 
not under their infinitive form, but their bare stem, because the whole conjugation can be 
deduced from the form of this stem. The main tenses of Sambahsa are present and past, but 
many other tenses can be obtained through the use of affixes or auxiliary verbs. Sambahsa uses 


the following endings, which are close to those found in many Indo-European languages." 7! 
Past tense 
Person Present and other tenses 
only 
First-person -o, -m (if the verb ends with a stressed vocalic sound) or nothing (if the last vowel | . 
-im 
singular of the verb is unstressed) 
Second-person 
; -s -(i)st(a) 
singular 
Third-person ; 
: -t -It 
singular 
First-person plural | -m(o)s -am 
Second-person 
-t(e) -at 
plural 
Third-person plural | -e(nt) ("-nt" is compulsory if the verb ends with a stressed vocalic sound) -(ee)r 


Sambahsa is unusual among auxlangs because of its use of a predictable ablaut system for the 
past tense and passive past participles. For example, eh within a verbal stem turns to oh. Other 
verbs that cannot use ablaut can drop their nasal infix, or use an improved version of the De 

Wahl's rules. Finally, the remaining verbs simply add the past tense endings, which are optional 


for verbs of the categories described above. 


Therefore, this system qualifies Sambahsa as a language belonging to the Indo-European family 
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of languages, though it remains a constructed language. 
Vocabulary 


Because of its rather large vocabulary for an auxlang (as of April 2021, the full Sambahsa-English 
dictionary contained more than 19,500 entries!'®!), it is difficult to assess the share of each 
language in Sambahsa's eclectic wordstock. However, the main layers are (either reconstructed or 
extrapolated) Indo-European vocabulary, Greco-Roman scientific and technical vocabulary (which 
is not discussed below, as it is more or less comparable to what is found in English) and multiple 
sources extending from Western Europe to Eastern Asia. 


Indo-European vocabulary 


The core of Sambahsa's vocabulary is undoubtedly of Indo-European origin. Only a few 
Sambahsa words can be traced back to pre-Indo-European times (like kamwns, ‘chamois’, cf. 
Basque: ahuntz). Many basic Sambahsa words are thus very close to their reconstructed Indo- 
European counterparts. See (Sambahsa/Proto-Indo-European): eghi/H,eghis (‘hedgehog’), 
ghelgh/*ghelghe- (‘gland’), pehk/*pek (‘to comb’), skand/*skand (‘to jump’), peungst/*pnk“sti- 
(‘fist’), wobhel/*wobhel- (‘weevil’), gvah/*gweHz (‘to go’), tox/*toksom (‘yew wood' in Sambahsa; 
‘yew' in PIE), treb/*trébs (‘dwelling’), oit/*H, ditos (‘oath’), poti/*potis ('Sir, lord’). But less attested 
Indo-European vocabulary is found in Sambahsa too. For example, the common Sambahsa word 
for person is anghen, as in semanghen, ‘someone, somebody’, and can be derived from PIE *? 
*H2enH,gh, only found in Old Armenian anjn (‘person’) and Old Norse angi (‘smell’). And motic 
(‘hoe’) may be a cognate of Old Church Slavonic motyka and English mattock. 


Further development from the Indo-European background 


Though Sambahsa, like any other conlang, has derivation rules, it sometimes uses backformation 
too. For example, the relation between Lithuanian bendras (‘companion’), Old Greek pentheros 
(‘father-in-law') and Sanskrit bandhu- (‘companion’) is uncertain;''?! however Sambahsa 
"reconstructs" this root as behndwr from behnd ‘to bind’. PIE has *dhéghom ‘earth’ and *dhinégh- 
(with nasal infix) 'to shape, to make pottery’; accordingly, Sambahsa has (di)ghom and dinegh, but 
the latter can be understood as "to put earth on" if we refer to yug (‘yoke’) and yuneg (‘to join’), 
both from PIE *yugom and *yunég-. 


The Sambahsa word for ‘ice pellet’ is kersnit; it rests on the word kersen ‘frozen snow’, itself from 
Old Norse Ajarn, Lithuanian sarma (‘frost’) and Russian serén.7°! But the suffix -it was abstracted 
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from PIE words like *sepit ‘grain of wheat’ and *Helbit ‘grain of barley’;'*"! thus kersnit can be 
understood as ‘a grain of frozen snow’. 


Words common to different language families 


A characteristic of Sambahsa is to include words found in different language families, while the 
most famous auxiliary languages tend to limit themselves to a compilation of Romance 


vocabulary with some borrowings from the Germanic languages. For example: 


e Schkaf (‘cupboard’) has cognates both in Germanic and Slavic languages: Russian wkad, Polish 
szafa, Ukrainian Wada, Danish skab, Icelandic skapur, Franconian dialect schaaf and Swedish 
skap. 


e Graf (‘count’, as a nobility title) is a German word from Greek grapheus that has been borrowed 
into many languages including Azerbaijani graf, Bulgarian rpad, Czech hrabé, Danish greve, 
Estonian krahv, Croatian grof, Hungarian grof, Finnish kreivi, Lithuanian grafas, Icelandic greifi 
and Russian rpad. 


e Bicair (‘mug’) is found in German Becher and many other Germanic languages. It comes from 
Low Latin bicarium and is at the origin of Hungarian pohar, Italian bicchiere and Romanian 


pahar, all meaning ‘glass’. 


e Saray means ‘big hall, palace’ and has the same Turkish and Persian origin as English seraglio 


but with a meaning closer to its etymology and to Russian capaiu (‘barn’). 


The Balkan sprachbund 


Though they belong to different language families, the languages spoken in Southeast Europe 
share a number of common grammatical features and of loanwords due to their historical 
background. That is why Sambahsa includes words from this region. 


e Schut (‘hornless') corresponds to Romanian sut, Bulgarian/Serbo-Croatian sut; also Albanian 
shut ‘hornless'’. 


e Potire (‘pitcher’) comes from Old Greek ttotrp, like Serbo-Croatian nytup, Russian notup, 
Romanian and Albanian potir. 


e Keramide (‘coating’) comes from Greek kepapida, which has given, among others, Romanian 
caramida (‘brick’) and Arabic 5310)5 girmida(t) ‘tile’. 


Words from Arabic and Persian 


A significant part of Sambahsa's vocabulary comes from Arabic and Persian. Both languages have 
extensively provided loanwords to a lexical continuum ranging from the Atlantic Ocean to 
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Indonesia because, respectively, of the spread of Islam and the brilliance of the former Persian 
civilization. Sambahsa learning materials often call this stratum "Muslim". 


e Amlak (‘assets’) comes from Arabic wJ\Lel and is found in Turkish emlak (‘estate’) and Persian 
Sol. 


e Zina (‘adultery’) comes from Arabic lj and is found in Persian and many other languages 
spoken by a majority of Muslims. 


e Adarb (‘merlon') comes from Spanish adarve and Portuguese adarve from Arabic W)> and 


ultimately Persian )» which has its origin in PIE *d’wer- like Sambahsa dwer (‘door’). 
Sinitic vocabulary 
Classical Chinese has heavily influenced the wordstock of neighbouring languages, mostly 


Japanese, Korean and Vietnamese. As a result, Sambahsa incorporates some Sinitic vocabulary, 
but the phonetic differences between these various languages can be high. 


e Kjingyow (‘goldfish’) corresponds to #4, which is read jinyu in Mandarin Pinyin and kingyo in 
Japanese. 


e Geong (‘fortified palace’) corresponds to the Han character 48% read chéng in Mandarin Pinyin, j6 
in Japanese Go-on reading, seong in Korean, and thanh in Vietnamese. 


e Ryienrlwey (‘humankind’) in an attempt to balance Mandarin rénlei, Japanese jinrul, Korean illyu, 
and other renderings of AH. This word showcases some seeming flaws of the language's 
approach to be reminiscent of all targeted languages at once. 


Not all Sambahsa Sinitic words come from Classical Chinese. The Min Nan language of Southern 
China provided loanwords to some South-East Asian languages, and some of these borrowings 
are, in turn, found in Sambahsa. 


e Sambahsa pangsit (‘wonton’) is an Indonesian word from Min Nan pian sit, while Mandarin 
Chinese (Pinyin) has hundun 


e Likewise, Sambahsa loteng (‘attic’) comes from Min Nan lauteng through Indonesian loteng. 


Sample phrases 


Sambahsa English 
Sellamat! Hello! 
Kam leitte yu? How are you? 
Leito. Fine. 


Bahte yu Sambahsa? Do you speak Sambahsa? 
No, ne bahm Sambahsa. | No, | don't speak Sambahsa. 


Marba! Pleased to meet you! 
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Literary works translated into Sambahsa 


e The Songs of Bilitis by Pierre Louys : Ja Songvs as Bilitis (https://fr.scribd.com/doc/83065380/la- 


Songvs-as-Bilitis) 


e Demian by Hermann Hesse : Demian (https://fr.scribd.com/doc/122334083/Demian-Sambahsa 


-tarjem) 


e The Stranger by Albert Camus : /s Gospoti (https://fr.scribd.com/doc/81520739/ls-Gospoti-l-Etr 
anger-in-Sambahsa) 


e The Little Prince by Antoine de Saint-Exupéry : /s Lytil Prince (https://fr.scribd.com/doc/6109874 
3/\s-Lytil-Prince-Le-Petit-Prince-en-Sambahsa-The-Little-Prince-in-Sambahsa) 


e The Gospel of Matthew : /d Euanghelio sekwent Matyah (https://fr.scribd.com/doc/147341646/E 
UANGHELIO-SEKWENT-MATYAH) 


e Alice's Adventures in Wonderland : /a Aventures as Alice in Daumsenland (http://www.evertype. 
com/books/alice-sambahsa.html) and Through the Looking-Glass (http://www.evertype.com/ 
books/looking-glass-sambahsa.html) published by Evertype 


e The Strange Case of Doctor Jekyll and Mister Hyde : /d Stragno Fall om Doctor Jekyll ed Poti 
Hyde (https://www.evertype.com/books/jekyll-sambahsa.html) published by Evertype 


e Un Coeur simple by Gustave Flaubert : Un simple kerd (https://fr.scribd.com/document/3809653 
22/Un-Simple-Kerd) 


e The Tower of the Elephant, The Scarlet Citadel, The Devil in Iron, A Witch Shall Be Born, Jewels 
of Gwahlur and Shadows in Zamboula by Robert E. Howard : /d Tor ios Elephant (https://fr.scrib 
d.com/document/334053099/ld-Tor-los-Elephant) ,/d Scarlato Citadell (https://fr.scribd.com/ 
document/335501303/ld-Scarlato-Citadell) , Diabel ex Sider (https://fr.scribd.com/document/ 
383775991/Diabel-Ex-Sider) , Gnahsiet un Yasa (https://fr.scribd.com/document/580718626/G 
nahsiet-Un-Yasa) ,/a Dents os Gwahlur (https://fr.scribd.com/document/611394629/la-Dents- 
Os-Gwahlur) ,Zamboulas Anthropophags (https://fr.scribd.com/document/638702165/Zambo 
ulas-Anthropophags) 


e A Scandal in Bohemia by Arthur Conan Doyle : Un Scandal in Chekhia (https://fr.scribd.com/do 
cument/469774223/Scandal-in-Chekhia) 


e The Call of Cthulhu and the Moon-Bog by H. P. Lovecraft : Kal os Cthulhu (https://fr.scribd.co 
m/document/558075126/Kal-Os-Cthulhu) , Luna-moor (https://fr.scribd.com/document/5580 
75759/Luna-moor) 
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Movies with Sambahsa subtitles 


e Revelations (a fan-made movie based on Star Wars): Revelations (https://www.youtube.com/wat 
ch?v=m-qjqQwFOBc) 


e The Hunt for Gollum (a fan-made prequel to the Lord of the Rings) : Sayd po Gollum (https://w 
ww.youtube.com/watch?v=GkAxzTfcMbU) 


e Born of Hope (a fan-made prequel to the Lord of the Rings) : Gnaht Speh (https://www.youtub 
e.com/watch?v=cY4aAtSf8el) 


e Home (a French movie by Yann Arthus-Bertrand about environmental threats) : Ghom (https:// 


www.dailymotion.com/playlist/x1ikyf_3abductee_ghom-in-sambahsa-hd/1#video=xgb2g3) 


e Kaydara (a fan-made movie based on The Matrix) : Kaydara (https://www.dailymotion.com/vide 


o/xickar_kaydara-official-film_shortfilms) 
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"The strange quest for a universal "Earth Standard" language" by Esther Inglis-Arkell, 08-17- 
2012 : http://io9.com/5935563/the-strange-quest-for-a-universal-earth-standard- 


language 


However, different versions of pronunciation of "r" are admitted, and the "ng" sound (as in 


English "sing") could be counted as a new sound, distinct from the conjunction of [n] + [g]. 


See this link on a French-speaking forum : http://aphil.forumn.net/t844p15-analyse- 
phonotactique-kotava-esperanto-uropi-et-autres?highlight=analyse+phon%E9tique 
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R.S.P. Beekes, Comparative Indo-European Linguistics, J.Benjamins.Pub., p.195 


With the exception of the nominative singular masculine, as in Latin, where the relative 


pronoun is gui, and the interrogative form is quis. 


But the genitive form serves only for the definite article, while the possessive pronouns have 


special forms (otherwise, confusions could have arisen). 


Under certain circumstances, the preposition bi can merge with the definite article in its 


dative form. 


They can be compared to the data provided in Indo-European Linguistics : an introduction by 
J. Clackson, Cambridge University Press, 2007, pp. 127 & 128. 


"Sambahsa English Dictionary - Nature" (https://www.scribd.com/doc/39063111/Sambahsa- 
English-Dictionary) . Scribd. Retrieved 27 June 2017. 


J.P Mallory & D.Q. Adams, Encyclopedia of Indo-European Culture, Fitzroy Dearborn 
Publishers, p.196 


ibidem, p.287 


ibidem, p.639 
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